System and method for classifying a disease state using representative data sets

ABSTRACT

System and method for determining a disease state of a sample. A sample is positioned in a field of view and a first spectroscopic data set is obtained. The positional information is stored and the sample is treated with a contrast enhancing agent. The sample is repositioned in the field of view and a digital image is obtained. The spectroscopic data is linked with the digital image and a database comprising representative spectroscopic data sets is searched to classify the disease state of the sample. The disclosure also provides for the step of obtaining a processed derivative image and searching a database comprising representative processed derivative images to classify a disease state of the sample.

RELATED APPLICATIONS

This application is a continuation-in-part to U.S. application Ser. No. 12/329,688, entitled “Method for Correlating Spectroscopic Measurements with Digital Images of Contrast Enhanced Tissue”, filed on Dec. 8, 2008 now U.S. Pat. No. 7,701,573, which is a continuation of U.S. application Ser. No. 11/527,839, now U.S. Pat. No. 7,477,378, entitled “Method for Correlating Spectroscopic Measurements with Digital Images of Contrast Enhanced Tissue,” filed on Sep. 27, 2006, which itself claims the benefit of U.S. Provisional Application No. 60/720,709, filed Sep. 27, 2005 entitled “Method for Correlating Raman Measurements with Digital Images of Stained Tissue.” All of these patents and applications are hereby incorporated by reference in their entireties.

FIELD OF DISCLOSURE

The present invention relates generally to a method and system to use spectroscopic measurements to classify a disease state through a correlation of spectroscopic measurements and digital images. More specifically the present invention relates to classifying a disease state of a sample using representative data sets wherein each representative data set is characteristic of a disease class.

BACKGROUND

Spectroscopy and imaging has held promise for adding quantitative and objective analysis of tissue samples. However, the application of spectroscopic measurements to tissue analysis is limited by the inability to correlate the spectroscopic data with histopathology which is evident in image data. This results from the interference of traditional contrasting agents with spectroscopic measurements. Therefore, there exists a need for a system and method that enables the correlation of spectroscopic data and histopathology. There also exists a need for more accurate and reliable systems and methods for analyzing such samples. The present disclosure describes an approach to overcome these limitations.

SUMMARY

The present disclosure provides for a system and method for classifying a disease state using representative data sets in a database. Each representative data set is characteristic of a disease class and comprises an analytically determined statistical representation of two or more members of a disease class. Therefore, such representative data sets represent an approximate average characteristic for members of a disease class, rather than a single data set representative of one individual of a disease class. This approach overcomes the limitations of the prior art because it provides for a more accurate assessment of a disease class of a sample and does not requiring linking of spectroscopic data to a specific digital image in a database.

The present disclosure provides for obtaining a spectroscopic data set of a sample and comparing this spectroscopic data set to representative spectroscopic data sets in a database wherein each representative spectroscopic data set is characteristic of a disease class. Based on this comparison, a disease state of the sample can be determined.

The present disclosure also provides for the determination of a disease class of a sample using a processed derivative image. This processed derivative image may be obtained by applying a chemometric technique to a spectroscopic data set obtained from a sample. This processed derivative image can then be compared to representative processed derivative images in a database to classify a disease state of the sample.

BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying drawings, which are included to provide further understanding of the disclosure and are incorporated in and constitute a part of this specification, illustrate embodiments of the disclosure and, together with the description, serve to explain the principles of the disclosure.

In the drawings:

FIG. 1 schematically represents an exemplary system of the present disclosure;

FIGS. 2A-2C illustrate the operation of an exemplary device used in the system of the present disclosure;

FIG. 3 is a flow chart illustrating an exemplary method of the present disclosure;

FIG. 4 is illustrative of a method of the present disclosure.

FIG. 5 is illustrative of a method of the present disclosure.

FIG. 6 is illustrative of a method of the present disclosure

FIG. 7 shows a digital image of kidney tissue before treatment with a contrast enhancing agent obtained by an embodiment of the present disclosure;

FIGS. 8A-8C show spatially accurate wavelength resolved Raman images of kidney tissue;

FIGS. 9A-9B show spatially accurate wavelength resolved fluorescence images of kidney tissue;

FIG. 10 shows a digital image of kidney tissue after treatment with a contrast enhancing agent;

FIG. 11 shows Raman spectra for the corresponding regions of interest illustrated in FIG. 10;

FIG. 12 shows fluorescence spectra for the corresponding regions of interest illustrated in FIG. 10.

FIG. 13 shows an exemplary graphical user interface used to perform the method of the present disclosure.

DETAILED DESCRIPTION OF THE DISCLOSURE

Reference will now be made in detail to the preferred embodiments of the present invention, examples of which are illustrated in the accompanying drawings. Wherever possible, the same reference numbers will be used throughout the drawings to refer to the same or like parts.

The present disclosure provides for a method to correlate spectroscopic measurements of samples with the spatial locations on digital images of contrast enhanced tissue. The correlation allows a user to classify the disease state of an unknown sample. Because treating a sample with a contrast enhancing agent typically interferes with spectroscopic measurements, spectroscopic data, for the unknown sample, are obtained prior to treating the unknown with the agent. The field of view of the spectroscopic measurement is stored so that the sample may be repositioned in the same field of view for later digital image measurements. The sample is then treated with the contrast enhancing agent and the unknown sample repositioned in the previously stored field of view. An image of the contrast enhanced sample is then obtained. The image of the contrast enhanced sample is linked to the spectroscopic measurement through a procedure of defining a mathematical translation of the relative spatial coordinates of the image of the contrast enhanced sample to the corresponding spatial coordinates of the spectroscopic measurements. The spatial coordinates of the digital image and the spatial coordinates of the spectroscopic measurements may be stored in a database. Because the two independent measurements are made on the same field of view, relative positions within the two datasets will correspond to the same location on the sample. By way of example, a single point halfway between the top and bottom and halfway between the left and right of the boundaries of the digital image of the contrast enhanced sample (at the center of the digital image) corresponds to the spectral measurement halfway between the top and bottom and halfway between the left and right of the boundaries of the set of spectroscopic measurements. By way of second example, the upper right quadrant of the digital image of the contrast enhanced sample corresponds to the upper right quadrant of a wavelength resolved spectra image obtained in set of spectroscopic measurements. This mathematical translation is in relative coordinates thus, there is no requirement that both images have the same pixel size or shape.

Through this procedure, the spectroscopic measurements are effectively linked to the digital images of the contrast enhanced sample. The method allows a user to classify the disease state of an unknown sample, based on its spectroscopic data, by searching a database containing spectroscopic information for known samples with well characterized pathology. This search can be performed on selected regions of the spectral data set. The method enables the search to be focused on selected regions of the spectral data set, containing spatially accurate wavelength resolved images, where the selected regions are targeted through use of the digital image of the contrast enhanced sample which is linked to the spectral data as described above. By way of example, in a case where a field of view contains both epithelial tissue and stromal tissue, a more accurate search of the database of spectral information can be obtained by selecting a subset spectral data corresponding to the epithelial tissue to be searched against the database. This subset of the spectral data can be determined after the digital image of the contrast enhanced sample is linked to the spectral measurements. The subset of spectral data is determined by identifying the spatial coordinates for a region of interest on the digital image of the contrast enhanced sample (corresponding to the epithelium for example), making the mathematical translation to identify the corresponding region of interest in the spectral dataset. The database is searched for the spectral data corresponding to the subset of the spectral data defined by the mapping of the region of interest from the digital image of the contrast enhanced sample to the spectral data set. This subset is searched against the database for matches for the spectral character of the sample.

A digital image of the sample prior to treatment with a contrast enhancing agent may also be obtained and stored. The digital image may be used if subtle positional differences are present between the images of the treated and untreated samples. In this embodiment, the digital image of the untreated sample provides the positioning of the regions of interest to resolve any discrepancy.

As described above, the image linking scheme is a tool for selecting which subset of the spectral image data for the selected region of interest is used for disease classification through searching the database. This is described above for a manual approach to select a region of interest on the digital image which is linked to the spectral image. Automated approaches based on image segmenting could equally be applied to select a region of interest using a digital image associated with a spectral image. For instance an automated algorithm for determining which regions of a digital image correspond to the nuclei of cells could be used to select the subset of the spectral image which is compared to the database. Moreover, there is no restriction that a subset of the spectral dataset is contiguous spatially.

FIG. 1 schematically represents an exemplary system 100 used to perform the methods of the present disclosure. System 100 includes, in a single platform, an imaging device in the form of a microscope objective 106, a spectroscopic device in the form of an imaging spectrometer 117 or a dispersive spectrometer 121, a processor 127, a database 125 and a microscope stage 103. System 100 further includes laser light source 107, white light source 105, bandpass filter 109 to remove SiO₂ bands arising from a laser excitation fiber optic. The laser light is directed to a band reject optical filter 110 and propagated through an imaging objective 106 to illuminate the sample 101. Objective 106 collects photons emanating from the sample 101. Notch filters 112 and 113 reject light at the laser wavelength.

Though the discussion herein focuses on the system illustrated in FIG. 1, the practice of the method of this disclosure is not limited to such a system. An alternative system with the ability to deliver digital images and spectroscopic data sets is described in U.S. Pat. No. 7,046,359 entitled “System and Method for Dynamic Chemical Imaging” which is incorporated herein by reference in its entirety.

Sample 101 is an unknown sample for which a user would like to classify its disease 20 state. Sample 101 may include a variety of samples such as tissue, tissue microarray, protein microarray, DNA microarray, and western blot. In one embodiment, sample 101 includes tissue. In another embodiment, the tissue includes kidney tissue, prostate tissue, lung tissue, colon tissue, bone marrow tissue, brain tissue, red blood tissue, breast tissue and cardiac muscle tissue

FIGS. 2A-2C illustrate sample 101 supported on a substrate 204 which is positioned on an exemplary XYZ translational microscope stage 103. Microscope stage 103 includes a movable stage such as an automated XYZ translational microscope stage 103 which functions to position the sample 101 in field of view 210 of the collection optics of spectroscopic devices 117 or 121. Sample 101 is positioned in the field of view 210 of spectroscopic device 117, FIG. 2A. Imaging device 106 and spectroscopic devices 117 and 121 are aligned to have the same field of view 210. The positional information for the field of view 210 is stored for later reference. In one embodiment, the positional information is the center of the field of view relative to some origin fixed on the sample holder. Spectroscopic devices 117 and 121 are used to obtain a spectroscopic data set for sample 101 positioned in the field of view 210. In one embodiment, spectroscopic devices 117 or 121 are used to obtain Raman data sets for a tissue sample and imaging device 106 is used to obtain digital images of the tissue sample. In another embodiment, spectroscopic devices 117 or 121 are used to obtain fluorescence datasets for a sample and imaging device 106 is used to obtain digital images of the tissue sample. Sample 101 is then moved from the field of view 210 using the XYZ translational microscope stage, as shown in FIG. 2B. While sample 101 is positioned outside the field of view 210, sample 101 is treated with a contrast enhancing agent. The contrast enhancing agent includes a stain, a haematoxylin and eosin stain, phosphototungstic acid haematoxylin, silver nitrate, silver metal, gold ions, gold metal, osmium (VIII) oxide and immunohistochemically targeted fluorescent stains. In one embodiment, the contrast enhancing agent includes a haematoxylin or eosin stain. The treated sample 215 is repositioned within the field of view 210 of spectroscopic devices 117 and 121 using the stored positional information, FIG. 2C. The imaging device 106 is used to obtain a digital image of the treated sample 215. In one embodiment, imaging device 106 is used to obtain a digital image of a tissue sample treated with a haematoxylin stain. By these steps, a user is able to obtain spectroscopic data of a sample before treatment with the contrast enhancing agent and digital images of the sample after treatment with the contrast enhancing agent. The spectroscopic data is obtained from the same spatial locations as observed in the digital images by storing the positional information of the field of view of spectroscopic devices 117 and 121.

To obtain a digital image, sample 101 is illuminated using a broad band light source 105, as illustrated in FIG. 1. In one embodiment, the white light source 105 is located under the sample 100 where system 100 operates a transmittance image mode. In a second embodiment, the white light source 105 is located above the sample 101 when system 100 operates in reflectance image mode. The transmitted or reflected light from the sample 101, positioned on the XYZ translational microscope stage 103, is collected using microscope objective 106. In one embodiment, microscope objective 106 includes an infinity-corrected microscope objective. The resulting digital image is detected by a CCD detector (not shown) and stored in database 125. In one embodiment, the database may comprise at least one representative data set. In another embodiment, said representative data set may comprise at least one of a representative spectroscopic data set and a representative processed derivate image data set.

Sample 101 is also illuminated with a laser light source 107. Light source 107 can include any conventional photon source, including laser, LED, and other IR or near IR devices. Light source 107 may also be selected to provide evanescence illumination of the sample. In one embodiment, the line width of the laser light source 107 is in the range of about 15-25 cm⁻¹. In another embodiment, laser epi-illumination is provided by light source 107, such as a Spectra Physics Millenia II Nd:YVO₄ laser beamed directly into the microscope optic. The monochromatic light reaching sample 101 illuminates the sample and photons are either scattered or emitted from different locations on or within the sample. The term emitted includes a wide range of optical processes including fluorescence, phosphorescence, photoluminescence, electroluminescence, chemiluminescence, sonoluminescence, thermoluminescence and even upconversion. Emitted photons or Raman scattered photons are collected by microscope objective 106 and directed to spectrometer 121 or imaging spectrometer 117. In another embodiment, illumination of the sample may produce photons absorbed or reflected by the sample.

Spectrometer 121 and imaging spectrometer 117 function to produce spectroscopic data sets of sample 101. A spectroscopic data set includes one or more of the following: a plurality of spectra and a plurality of spatially accurate wavelength resolved spectroscopic images. In one embodiment, the plurality of spectra may comprise at least one of: a plurality of Raman spectra, a plurality of fluorescence, spectra, a plurality of infrared spectra, a plurality of near infrared spectra, a plurality of short wave infrared spectra, a plurality of mid infrared spectra, a plurality of ultraviolet spectra, a plurality of visible spectra, and combinations thereof. In another embodiment, the plurality of spatially-accurate wavelength resolved images may comprise at least one of: a spatially-accurate wavelength resolved Raman image, a spatially-accurate wavelength resolved fluorescence image, a spatially-accurate wavelength resolved infrared image, a spatially-accurate wavelength resolved near infrared image, a spatially-accurate wavelength resolved short wave infrared image, a spatially-accurate wavelength resolved mid infrared image, a spatially-accurate wavelength resolved ultraviolet image, a spatially-accurate wavelength resolved visible image, and combinations thereof. In yet another embodiment, the plurality of spectra includes a plurality of transmittance spectra and the plurality of spatially accurate wavelength resolved spectroscopic images include a plurality of spatially accurate wavelength resolved transmittance images.

The spectroscopic data set may contain spectroscopic subsets where the subset includes a plurality of spectra for the region of interest selected from the digital image The plurality of spectra are obtained using dispersive spectrometer 121. A swing away mirror 115 is placed before filter 117 to redirect the emitted or Raman scattered photons to a fiber-optic 118. The other end of fiber-optic 118 is configured in a linear geometry and is focused on the entrance slit of a dispersive spectrometer 121. The plurality of spectra are detected by CCD detector 123 located at the exit focal plane of the spectrometer 121.

Referring still to FIG. 1, an imaging spectrometer 117 is used to generate the plurality of spatially accurate wavelength resolved spectroscopic images. The imaging spectrometer includes a two-dimensional tunable filter, such as electro-optical tunable filters, liquid crystal tunable filter (“LCTF”) or acousto-optical tunable filter (“AOTF”).

In one embodiment, the filter may comprise a multi-conjugate tunable filter (“MCF”). In one embodiment, the system and method of the present disclosure may comprise multi-conjugate filter technology available from ChemImage Corporation, Pittsburgh, Pa. This technology is more fully described in U.S. Pat. No. 6,992,809 entitled “Multi-Conjugate Liquid Crystal Tunable Filter,” filed on Feb. 2, 2005, and U.S. Pat. No. 7,362,489, also entitled “Multi-Conjugate Liquid Crystal Tunable Filter, filed on Apr. 22, 2005. These patents are hereby incorporated by reference in their entireties.

The electro-optical filter (interchangeably, tunable filters) sequentially passes the absorbed, reflected, emitted or Raman scattered photons in each of a plurality of predetermined wavelength bands. The plurality of predetermined wavelength bands include specific wavelengths or ranges of wavelengths. In one embodiment, the predetermined wavelength bands include wavelengths characteristic of the sample undergoing analysis. The wavelengths that can be passed through tunable filter 140 may range from 200 nm (ultraviolet) to 2000 nm (i.e., the far infrared). The choice of tunable filter depends on the desired optical region and/or the nature of the sample being analyzed. The two-dimensional tunable filter includes a Fabry Perot angle tuned filter, an acousto-optic tunable filter, a liquid crystal tunable filter, a Lyot filter, an Evans split element liquid crystal tunable filter, a Sole liquid crystal tunable filter, a spectral diversity filter, a photonic crystal filter, a fixed wavelength Fabry Perot tunable filter, an air-tuned Fabry Perot tunable filter, a mechanically-tuned Fabry Perot tunable filter, a liquid crystal Fabry Perot tunable filter, and a multi-conjugate tunable filter. The tunable filer is selected to operate in one or more of the following spectral ranges: the ultraviolet (UV), visible, near infrared, and mid-infrared.

The plurality of spectra are detected by detector 123 and the plurality of spatially accurate wavelength resolved spectroscopic images are detected by detector 119. Detector 119 detects, in a spatially accurate manner, the emitted, absorbed, reflected, or Raman scattered or transmitted photons passed by imaging spectrometer 117. Detectors 119 and 123 may include a digital device such as for example an image focal plane array (“FPA”) or CCD or CMOS sensor. The optical region employed to characterize the sample of interest governs the choice of two-dimensional array detector. For example, a two-dimensional array of silicon charge-coupled device (“CCD”) detection elements can be employed with visible wavelength emitted or Raman scatter photons, while gallium arsenide (GaAs) and gallium indium arsenide (GaInAs) FPA detectors can be employed for image analyses at near infrared wavelengths. The choice of such devices depends on the type of sample being analyzed.

The spectroscopic data set and the digital image of the sample 101 are stored in database 125, shown in FIG. 1. For sample 101, its spectroscopic data set may be linked with the digital image of the sample 101. In one embodiment, Raman spectroscopic data for a sample is linked to a digital image of the sample treated with haematoxylin stain and or eosin. As was discussed above, the digital image and the spectroscopic data set are linked through a transformation. The digital image, of the treated sample, may be characterized by a plurality of spatial coordinates. These spatial coordinates describe the x and y positions of the various features observed in the digital image. The spatially accurate wavelength resolved images, that are part of the spectroscopic data sets, are also characterized by a plurality of spatial spectral coordinates. The digital image and the spectroscopic data set are then linked through a transformation that maps the plurality spatial coordinates of the digital image to the corresponding plurality of spatial coordinates of the spectroscopic data set for sample 101.

In one embodiment, the database 125 may store a plurality of spectroscopic data sets and digital images for known samples. The known samples have well characterized pathology of various disease conditions made through pathological examination of the digital images. The disease conditions include cancer, infection, stroke, ischemia, metabolic disorder, autoimmune disorders and heart attack. In another embodiment, the database 125 comprises at least one representative data set wherein each said representative data set is characteristic of a disease class. In one embodiment, the representative data set comprises at least one representative spectroscopic data set wherein each representative spectroscopic data set is characteristic of a disease class. In another embodiment, the representative data set may comprise at least one representative processed derivative image data set, wherein each processed derivative image data set is characteristic of a disease class. In such embodiments, “characteristic of disease class” may refer to the fact that each representative data set comprises an analytically derived statistical representation of two or more members of a disease class (i.e., average, mean, median, mode, etc.).

To determine the spectroscopic data set or subset of sample 101 for analysis, the spatial coordinates of a region of interest are identified from the digital image of the treated sample. A corresponding region of interest is then identified for the spectroscopic data set or subset based on the transformation discussed above. The spectroscopic data set or subset includes one or more spatially accurate wavelength resolved spectroscopic images.

Processor 127 is configured to execute a machine readable program code 129 to search the database 125. For the spectroscopic data set or subset of the sample 101 under analysis, the database is searched to identify a spectroscopic data set, for a known sample having well characterized pathology, matching the spectroscopic data set of the sample 101. In one embodiment, database 125, is searched for a Raman data set for a known sample that matches the Raman spectrum of a tissue sample from a subject which is suspected of having a disease. The database can be searched using a variety of similarity metrics. The metrics include Euclidean Distance, the Spectral Angle Mapper (SAM), the Spectral Information Divergence (SID), Mahalanobis distance metric and spectral unmixing. A spectral unmixing metric is disclosed in U.S. Pat. No. 7,072,770 B1 entitled “Method for Identifying Components of a Mixture via Spectral Analysis,” which is incorporated herein by reference in its entirety.

The use of Raman spectroscopy to detect diseases is disclosed in the following: U.S. patent application Ser. No. 11/269,596 entitled “Cytological Methods for Detecting Disease Conditions Such as Malignancy by Raman Spectroscopic Imaging,” filed Nov. 9, 2005; U.S. patent application Ser. No. 11/000,545, filed Nov. 20, 2004, entitled “Raman Molecular Imaging for Detection of Bladder Cancer, which are incorporated by reference herein it their entirety. In one embodiment, the database is searched to determine if the tissue sample is indicative of bladder cancer by the sample's Raman spectra data sets. Cancerous bladder cells exhibit significant Raman scattering at an RS value of about 1584 cm⁻¹, relative to non-cancerous bladder cells. The intensity of Raman scattering at this RS values increases with increasing grade of bladder cancer. Other RS values at which Raman scattering is associated with the cancerous state of bladder cells include about 1000, 1100, 1250, 1370, and 2900 cm⁻¹. Furthermore, there is a generalized increase in Raman scattering at RS values in the range from about 1000 to 1650 cm⁻¹ and in the range from about 2750 to 3200 cm⁻¹ in bladder cancer cells, relative to non-cancerous bladder cells, and this generalized increase is more pronounced in the range of RS values from about 1530 to 1650 cm⁻¹. These RS values and ranges are useful for assessing the cancerous state of bladder cells.

Processor 127 is also configured to execute machine readable program code containing executable program instructions to perform a variety of functions. These functions are illustrated in FIG. 3 which shows a flow chart for a method of the present disclosure. In step 310, a first spectroscopic data set for a sample positioned in a field of view of a spectroscopic device is obtained and stored in a database. In step 320, the positional information about the field of view is stored in a database. In step 330, the repositioning of the contrast enhancing treated sample in the field of view of the spectroscopic device is monitored using the stored positional information about the field of view. In step 340, a digital image of the treated sample positioned in the field of view is obtained and stored in the database. The field of view is the same field of view at that in step 310. In step 350, the database is searched to identify a second spectroscopic data set matching the first spectroscopic data set or a subset of the spectroscopic dataset chosen using the linked digital image as a guide. The second spectroscopic data set is for a known sample having well characterized pathology.

In another embodiment, the present disclosure provides for a system comprising: a spectroscopic device, an imaging device, a database having a plurality of representative spectroscopic datasets, wherein each representative spectroscopic data set is characteristic of a disease class, a machine readable program code containing executable program instructions, and a processor operatively coupled to said spectroscopic device and said imaging device configured to execute said machine readable program code so as to perform the following: using said spectroscopic device, obtain a first spectroscopic data set for a sample positioned in a field of view, store positional information about said field of view, after the sample is treated with a contrast enhancing agent, monitor the repositioning of the treated sample in said field of view of the spectroscopic device using said stored positional information about said field of view, using said spectroscopic device, obtain a digital image of the treated sample positioning in said field of view linking said first spectroscopic data set with said digital image; and classifying a disease state of said sample, wherein said classification comprises: for said first spectroscopic data set, searching said database to thereby identify a representative spectroscopic data set therein that matches said first spectroscopic dataset to thereby classify a disease state of said sample.

In another embodiment, the present disclosure provides for a system comprising: a spectroscopic device; an imaging device; a database having a plurality of representative processed derivative image data sets, wherein each representative processed derivative image data set is characteristic of a disease class; a machine readable program code containing executable program instructions; and a processor operatively coupled to said spectrographic device and said imaging device configured to execute said machine readable program code so as to perform the following: using said spectroscopic device, obtain a first spectroscopic data set for a sample positioned in a field of view, obtain a first processed derivative image from said spectroscopic data set, store positional information about said field of view, after the sample is treated with a contrast enhancing agent, monitor the repositioning of the treated sample in said field of view, using said spectroscopic device, obtain a digital image of the treated sample positioning in said field of view, linking said first processed derivative image of said sample with said digital image; and classifying a disease state of said sample, wherein said classification comprises: for said first processed derivative image, searching said database to thereby identify a representative processed derivative image data set therein that matches said first processed derivative image to thereby classify a disease state of said sample.

In one embodiment of a system of the present disclosure, said first spectroscopic data set comprises at least one of: a spectra and a spatially-accurate wavelength resolved image. In one embodiment, said first spectroscopic data set is obtained using a spectroscopic technique selected from the group consisting of: Raman, infrared, short wave infrared, near infrared, mid infrared, ultraviolet, fluorescence, visible, and combinations thereof.

In one embodiment, the system may also comprise a filter. The filter may be a tunable filter. In one embodiment, the tunable filter may comprise a multi-conjugate tunable filter. In another embodiment, the filter may comprise a liquid crystal tunable filter.

The present disclosure also provides for methods for determining a disease state of a sample using a database comprising representative data sets wherein each representative data set is characteristic of a disease class. Meaning, each representative data set comprises an analytically derived statistical representation of at least two members of a disease class. This statistical representation may be an average. In another embodiment, the statistical representation may be a mean, a median, or a mode. Using representative data sets holds potential for analysis of samples because it provides for a more accurate representation of a typical member of a disease class. Since the representative data set represents two or more members of a disease class, it is less likely that one outlier data set would prevent an accurate classification of a sample. The method also holds potential for analysis of samples because it does not require linking of spectroscopic data in a database to a specific digital image of one member of a class.

FIG. 4 is illustrative of a method of the present disclosure. The method 400 comprises positioning a sample in a field of view of a spectroscopic device in step 410. In step 420 a first spectroscopic data set is obtained for the sample positioned in said field of view. The positional information is stored in step 430 about said field of view. The sample is treated with a contrast enhancing agent in step 440. In step 450 the treated sample is repositioned in said field of view of the spectroscopic device using said stored information about said field of view. A digital image if the treated sample positioned in said field of view is obtained in step 460. In step 470 said spectroscopic data base is linked with said digital image. In step 480 a disease state of a sample is classified wherein said classification comprises providing a database having at least one representative spectroscopic data set wherein said representative spectroscopic data set is characteristic of a disease class, and for said first spectroscopic data set, searching said database to thereby identify a representative spectroscopic data set therein that matches said first spectroscopic dataset to thereby classify a disease state of said sample.

In one embodiment, said representative spectroscopic data set comprises an analytically determined statistical representation of spectroscopic data for two or more members of a disease state class. In one embodiment, said statistical representation comprises at least one of: a mean, a median, a mode, and combinations thereof. The present disclosure also contemplates that other statistical approaches may be used.

In one embodiment, the first spectroscopic data set may comprise at least one of a spectra and a spatially-accurate wavelength resolved image. Said first spectroscopic data set may be obtained using a spectroscopic technique such as Raman, infrared, near infrared, mid infrared, short wave infrared, ultraviolet, fluorescence, visible, and combinations thereof.

FIG. 5 is illustrative of another method of the present disclosure. In such an embodiment, the method 500 comprises positioning a sample in a field of view of a spectroscopic device in step 510. In step 520 a first spectroscopic data set is obtained for the sample positioned in said field of view. In step 530 a first processed derivative image is generated from said first spectroscopic data set. In step 540 positional information is stored about said field of view. The sample is treated with a contrast enhancing agent in step 550. In step 560 the treated sample is repositioned in said field of view of the spectroscopic device using said stored positional information about said field of view. A digital image is obtained of the treated sample positioned in said field of view in step 570. In step 580, said first processed image is linked with said digital image and a disease state of the sample is classified.

In one embodiment, the processed derivative image is obtained by applying a chemometric technique. The technique may be selected from the group consisting of: principal component analysis (PCA), Partial Least Squares Discriminate Analysis (PLSDA), Cosine Correlation Analysis (CCA), Euclidian Distance Analysis (EDA), k-means clustering, multivariate curve resolution (MCR), Band T. Entropy Method (BTEM), k means clustering, Mahalanobis Distance (MD), Adaptive Subspace Detector (ASD), and combinations thereof. Said representative processed image data set may comprise in one embodiment an analytically derived statistical representation of spectroscopic data for a group of members of a disease class. This statistical representation may be any known in the art including one selected from the group consisting of: a mean, a median, a mode, and combinations thereof.

FIG. 6 is illustrative of yet another method of the present disclosure. In such an embodiment, the method 600 comprises positioning a sample in a field of view of a spectroscopic device in step 610. In step 620 a first spectroscopic data set is obtained for the sample positioned in said field of view. In step 630 a first processed derivative image is generated from said first spectroscopic data set. In step 640 positional information is stored about said field of view. The sample is treated with a contrast enhancing agent in step 650. In step 660 the treated sample is repositioned in said field of view of the spectroscopic device using said stored positional information about said field of view. A digital image is obtained of the treated sample positioned in said field of view in step 670. In step 680, said first processed image is linked with said digital image and a disease state of the sample is classified. In the embodiment of the method in FIG. 6, the classifying comprises providing a database having a plurality of representative processed derivative image data sets, wherein each representative processed derivative image data set is characteristic of a disease state class in step 690. In step 695 said database is searched to thereby identify a representative processed derivative image data set therein that matches said first representative processed derivative image to thereby classify a disease state of said sample.

EXAMPLES

Example 1 illustrates a set of image and spectroscopic data for a thin section of kidney tissue mounted on an aluminum coated slide. FIG. 7 shows a digital image for kidney tissue which has not been treated with a contrast enhancing agent. FIGS. 8A-8C show a series of Raman images of the kidney tissue of FIG. 7. FIG. 8A shows a spatially accurate Raman image at 1450 cm⁻¹. FIG. 8B shows a spatially accurate Raman image at 1650 cm⁻¹. FIG. 8C shows a spatially accurate Raman image at 2930 cm⁻¹. FIGS. 9A-B illustrate a series of fluorescence images of the kidney tissue of FIG. 7. FIG. 9A shows a spatially accurate fluorescence image at 515 nm. FIG. 9B shows a spatially accurate fluorescence image at 570 10 nm. FIG. 10 shows a digital image of the kidney tissue after the tissue was stained with hematoxalin and eosin following standard staining procedures. FIG. 10 shows regions of interest 1010, 1020, 1030 and 1040 that are used to extract Raman and fluorescence data sets for searching. FIG. 11 illustrates a subset of Raman spectra for the regions of interest in FIG. 10 where the spectra were extracted from the corresponding regions of interest in the spectroscopic data set: Raman spectrum A corresponds to region of interest 1010; Raman spectrum B corresponds to region of interest 1020; Raman spectrum C corresponds to region of interest 1030; and Raman spectrum D corresponds to region of interest 1040. FIG. 12 illustrates fluorescence spectra for the regions of interest in FIG. 10 where the spectra were extracted from the corresponding regions of interest in the spectroscopic data set: fluorescence spectrum A corresponds to region of interest 1010; fluorescence spectrum B corresponds to region of interest 1020; fluorescence spectrum C corresponds to region of interest 1030; and fluorescence spectrum D corresponds to region of interest 1040.

FIG. 13 shows a graphical user interface for an embodiment of a system used in performance of a method of the present disclosure. A digital image 1310 is shown for a piece of bladder tissue stained with Hematoxalin and Eosin following standard staining procedures. This digital image 1310 is linked to its Raman spectral dataset. The region of interest 1320 highlighted in the digital image is used to select a subset of the spectral data set for searching. The spectra data set is indicated in the spectral portion 1330 of FIG. 12. A subset of the spectral image dataset, in the form of a spectral trace, was searched against a database using Euclidian distance as a metric. The results of the search are evident in the left frame 1340. The correct classification of the region of interest is a nucleus from a bladder tumor. The disease classification of the samples in the database was obtained by pathological characterization. The spectra data search results, in the left frame 1340, identified nucleus from a bladder tumor as the top two ranked results.

The present disclosure may be embodied in other specific forms without departing from the spirit or essential attributes of the disclosure. Accordingly, reference should be made to the appended claims, rather than the foregoing specification, as indicating the scope of the disclosure. Although the foregoing description is directed to the preferred embodiments of the disclosure, it is noted that other variations and modification will be apparent to those skilled in the art, and may be made without departing from the spirit or scope of the disclosure. 

1. A method comprising: positioning a sample in a field of view of a spectroscopic device; obtaining a first spectroscopic data set for the sample positioned in said field of view; storing positional information about said field of view; treating the sample with a contrast enhancing agent; repositioning the treated sample in said field of view of the spectroscopic device using said stored positional information about said field of view; obtaining a digital image of the treated sample positioned in said field of view; linking said first spectroscopic data set with said digital image; and classifying a disease state of said sample, wherein said classification comprises: providing a database having at least one representative spectroscopic data set wherein said representative spectroscopic data set is characteristic of a disease class, and for said first spectroscopic data set, searching said database to thereby identify a representative spectroscopic data set therein that matches said first spectroscopic dataset to thereby classify a disease state of said sample.
 2. The method of claim 1 wherein said representative spectroscopic data set comprises an analytically determined statistical representation of spectroscopic data for two or more members of a disease state class.
 3. The method of claim 2 wherein said statistical representation comprises at least one of: a mean, a median, a mode, and combinations thereof.
 4. The method of claim 1 wherein said first spectroscopic data comprises at least one of: a spectra, a spatially-accurate wavelength resolved image.
 5. The method of claim 1 wherein said first spectroscopic data set is obtained using a spectroscopic technique selected from the group consisting of: Raman, infrared, short wave infrared, near infrared, short wave infrared, mid infrared, ultraviolet, fluorescence, visible, and combinations thereof.
 6. A method comprising: positioning a sample in a field of view of a spectroscopic device; obtaining a first spectroscopic data set for the sample positioned in said field of view; generating a first possessed derivative image from said first spectroscopic data set; storing positional information about said field of view; treating the sample with a contrast enhancing agent; repositioning the treated sample in said field of view of the spectroscopic device using said stored positional information about said field of view; obtaining a digital image of the treated sample positioned in said field of view; and linking said first processed derivative image with said digital image and classifying a disease state of said sample.
 7. The method of claim 6 wherein said classifying further comprises: providing a database having a plurality of representative processed derivative image data sets, wherein each representative processed derivative image data set is characteristic of a disease class; searching said database to thereby identify a representative processed derivative image data set therein that matches said first representative processed derivative image to thereby classify a disease state of said sample.
 8. The method of claim 6 wherein said first processed derivative image is obtained by applying a chemometric technique.
 9. The method of claim 8 wherein said chemometric technique is selected from the group consisting of: principal component analysis (PCA), Partial Least Squares Discriminate Analysis (PLSDA), Cosine Correlation Analysis (CCA), Euclidian Distance Analysis (EDA), k-means clustering, multivariate curve resolution (MCR), Band T. Entropy Method (BTEM), k means clustering, Mahalanobis Distance (MD), Adaptive Subspace Detector (ASD), and combinations thereof.
 10. The method of claim 7 wherein said representative processed image data set comprises an analytically derived statistical representation of spectroscopic data for two or more members of a disease class.
 11. The method of claim 10 wherein said statistical representation comprises at least one of: mean, median, mode, and combinations thereof.
 12. The method of claim 6 wherein said first spectroscopic data set comprises at least one of: a spectra and a spatially-accurate wavelength resolved image.
 13. The method of claim 6 wherein said first spectroscopic data set is obtained using a spectroscopic technique selected from the group consisting of: Raman, infrared, short wave infrared, near infrared, mid infrared, ultraviolet, fluorescence, visible, and combinations thereof.
 14. A system comprising: a spectroscopic device; an imaging device; a database having a plurality of representative spectroscopic data sets, wherein each representative spectroscopic data set is characteristic of a disease class; a machine readable program code containing executable program instructions; and a processor operatively coupled to said spectroscopic device and said imaging device configured to execute said machine readable program code so as to perform the following: using said spectroscopic device, obtain a first spectroscopic data set for a sample positioned in a field of view, store positional information about said field of view, after the sample is treated with a contrast enhancing agent, monitor the repositioning of the treated sample in said field of view of the spectroscopic device using said stored positional information about said field of view, using said spectroscopic device, obtain a digital image of the treated sample positioning in said field of view linking said first spectroscopic data set with said digital image; and classifying a disease state of said sample, wherein said classification comprises: for said first spectroscopic data set, searching said database to thereby identify a representative spectroscopic data set therein that matches said first spectroscopic dataset to thereby classify a disease state of said sample.
 15. The system of claim 14 wherein said first spectroscopic data set comprises at least one of: a spectra and a spatially-accurate wavelength resolved image.
 16. The system of claim 14 wherein said first spectroscopic data set is obtained using a spectroscopic technique selected from the group consisting of: Raman, infrared, short wave infrared, near infrared, mid infrared, ultraviolet, fluorescence, visible, and combinations thereof.
 17. The system of claim 14 further comprising a tunable filter.
 18. The system of claim 17 wherein said tunable filter comprises a multi-conjugate tunable filter.
 19. The system of claim 17 wherein said tunable filter comprises a liquid crystal tunable filter.
 20. A system comprising: a spectroscopic device; an imaging device; a database having a plurality of representative processed derivative image data sets, wherein each representative processed derivative image data set is characteristic of a disease class; a machine readable program code containing executable program instructions; and a processor operatively coupled to said spectrographic device and said imaging device configured to execute said machine readable program code so as to perform the following: using said spectroscopic device, obtain a first spectroscopic data set for a sample positioned in a field of view, obtain a first processed derivative image from said spectroscopic data set, store positional information about said field of view, after the sample is treated with a contrast enhancing agent, monitor the repositioning of the treated sample in said field of view, using said spectroscopic device, obtain a digital image of the treated sample positioning in said field of view, linking said first processed derivative image of said sample with said digital image; and classifying a disease state of said sample, wherein said classification comprises: for said first processed derivative image, searching said database to thereby identify a representative processed derivative image data set therein that matches said first processed derivative image to thereby classify a disease state of said sample.
 21. The system of claim 20 wherein said first spectroscopic data set comprises at least one of: a spectra and a spatially-accurate wavelength resolved image.
 22. The system of claim 20 wherein said first spectroscopic data set is obtained using a spectroscopic technique selected from the group consisting of: Raman, infrared, short wave infrared, near infrared, mid infrared, ultraviolet, fluorescence, visible, and combinations thereof.
 23. The system of claim 20 further comprising a tunable filter.
 24. The system of claim 23 wherein said tunable filter comprises a multi-conjugate tunable filter.
 25. The system of claim 23 wherein said tunable filter comprises a liquid crystal tunable filter. 